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Abstract. The chromatic threshold S X (H) of a graph H is the infimum of d > such 
that there exists C — C (H, d) for which every H-iree graph G with minimum degree at 
least d\G\ satisfies x(G) < C. We prove that 
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for every graph H with x(H) = r ^ 3. We moreover characterise the graphs H with a 
given chromatic threshold, and thus determine S X (H) for every graph H. This answers 
a question of Erdos and Simonovits [Discrete Math. 5 (1973), 323-334], and confirms a 
conjecture of Luczak and Thomassc [preprint (2010), 18pp]. 



1. Introduction 

Two central problems in Graph Theory involve understanding the structure of graphs 
which avoid certain subgraphs, and bounding the chromatic number of graphs in a given 
family. For more than sixty years, since Zykov [42] and Tutte [5] first constructed triangle- 
free graphs with arbitrarily large chromatic number, the interplay between these two 
problems has been an important area of study. The generalisation of Zykov's result, by 
Erdos [12], to if- free graphs (for any non-acyclic H), was one of the first applications of 
the probabilistic method in combinatorics. 

In 1973, Erdos and Simonovits [14] asked whether such constructions are still possible 
if one insists that the graph should have large minimum degree. As a way of investigating 
their problem, they implicitly defined what is now known as the chromatic threshold of a 
graph H as follows (see [14, Section 4]): 

S X (H) := inf |d : 3 C = C(H, d) such that if G is a graph on n vertices, 

with 5(G) ^ dn and H % G, then X (G) < c}- 
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That is, for d < S X (H) the chromatic number of if- free graphs with minimum degree dn 
may be arbitrarily large, while for d > S X (H) it is necessarily bounded. In this paper we 
shall determine 5 X (H) for every graph H, and thus completely solve the problem of Erdos 
and Simonovits. 

The chromatic threshold has been most extensively investigated for the triangle H = A3, 
where in fact much more is known. Erdos and Simonovits [14] conjectured that S X (K 3 ) = |, 
which was proven in 2002 by Thomassen [39], and that moreover if G is triangle-free and 
5(G) > n/3 then ^ 3. This stronger conjecture was disproved by Haggkvist [18], 

who found a (10n/29)-regular graph with chromatic number four. However, Brandt and 
Thomasse [8] recently showed that the conjecture holds with x(G) ^ 3 replaced by x(G) ^ 
4. Hence the situation is now well-understood for triangle-free graphs G (see [3, 7, 8, 9, 
18, 20, 30]), and can be summarised as follows: 
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For bipartite graphs H, it follows trivially from the K6vari-S6s-Turan Theorem [26] that 
S X (H) = 0, and for larger cliques, Goddard and Lyle [16] determined the chromatic thresh- 
old, proving that S x (K r ) = |^5§ f° r every r ^ 3. Erdos and Simonovits also conjectured 
that 8 x (Cs) = 0, which was proven (and generalised to all odd cycles) by Thomassen [40]. 

For graphs other than cliques and odd cycles, very little was known until the recent 
work of Lyle [31] and the breakthrough of Luczak and Thomasse [30] who introduced a 
new technique which allows the study of S X (H) for more general classes of graphs. In order 
to motivate their results, let us summarise what was known previously for 3-chromatic 
graphs H . We have seen that there are such graphs with chromatic threshold (the odd 
cycle C 5 ), and chromatic threshold | (the triangle). A folklore result is that there are 
also 3-chromatic graphs with chromatic threshold |, such as the octahedron K 2 ^,2- Indeed, 
given a graph H with x(H) = r ^ 3, the decomposition family Ai(H) of H is the set of 
bipartite graphs which are obtained from H by deleting r — 2 colour classes in some r- 
colouring of H. Observe that K 2t 2,2 has the property that its decomposition family contains 
no forests (in fact, .M (A'2^,2) = {£4})- It is not difficult to show that whenever there are 
no forests in Ai(H), the graph H has chromatic threshold | (see Proposition 5). 

Thus it remains to consider those 3-chromatic graphs whose decomposition family does 
contain a forest; in other words, graphs which admit a partition into a forest and an 
independent set (such as all odd cycles). Lyle [31] proved that these graphs have chromatic 
threshold strictly smaller than |. In fact, as we shall show, they have chromatic threshold 
at most I (see Theorem 2). Luczak and Thomasse [30] described a large sub-family of these 
graphs with chromatic threshold strictly smaller than |. More precisely they considered 
triangle-free graphs which admit a partition into a (not necessarily perfect) matching and 
an independent set; they called a graph (such as C5) near-bipartite if it is of this form, and 
proved that if H is near-bipartite then 5 X (H) = 0. 
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Figure 1. An illustration of near-acyclic graphs with partition into an in- 
dependent set S and a forest F consisting of two trees 7\ and T2. 

However, Luczak and Thomasse did not believe that the near-bipartite graphs are the 
only graphs with chromatic threshold zero. Generalising near-bipartite graphs, they de- 
fined H to be near-acyclic if x{H) = 3 and H admits a partition into a forest F and an 
independent set S such that every odd cycle of H meets S in at least two vertices. Equiva- 
lently, for each tree T in F with colour classes V\(T) and V^T), there is no vertex of S with 
neighbours in both Vi(T) and V^(T) (see also Figure 1). Observe that the near-bipartite 
graphs are precisely the near-acyclic graphs in which every tree is a single edge or vertex. 
The first graph in Figure 2 is near-acyclic (as illustrated by the highlighted forest), but 
has no matching in its decomposition family and thus is not near-bipartite. 

Lyle [31] proved that S X (H) = for a sub-family of near-acyclic graphs H which are not 
necessarily near-bipartite, and Luczak and Thomasse gave a construction (see Section 4) 
showing that every graph which is not near-acyclic has chromatic threshold at least |. 
They made the following conjecture. 

Conjecture 1 (Luczak and Thomasse [30]). Let H be a graph with x{H) = 3. Then 
S X (H) = if and only if H is near-acyclic. 

We shall prove Conjecture 1, and moreover determine 5 X (H) for every graph H. In this 
theorem, we use the following generalisation of near-acyclic graphs. We call a graph H 
r -near- acyclic if x{H) = r ^ 3, and there exist r — 3 independent sets in H whose removal 
yields a near-acyclic graph. Note in particular that if H is r-near-acyclic, then there is a 
forest in A4(H). Our main theorem is as follows. 

Theorem 2. Let H be a graph with x{H) = r ^ 3. Then 

r — 3 2r — 5 r — 2 1 
r - 2' 2r - 3' r - 1 J ' 

Moreover, S X (H) ^ if an d on ^V if H has a forest in its decomposition family, and 
S X (H) = if an d only if H is r-near-acyclic. 

For example, the dodecahedron is 3-chromatic and near-bipartite, hence it has chromatic 
threshold (Figure 2 shows the dodecahedron together with a corresponding matching). 



5 X (H) G 
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Figure 2. A near-acyclic graph that is not near-bipartite, the dodecahe- 
dron, and the icosahedron. 

The icosahedron on the other hand has chromatic threshold I because it is four-chromatic 
and has a forest in its decomposition family (a partition of the icosahedron into this forest 
and two independent sets is also displayed in Figure 2), but is not 4-near-acyclic. 
For easier reference, given H with x(H) = r, we define 

\{H):= 2 -^- and tt(H) := ^ . 
r — 2 2r — 3 r — 1 

Observe that tt(H) is precisely the Turan density of H, and therefore the Erdos-Stone 
Theorem [15] yields S X (H) «C ft(H) for all H. Furthermore, the constructions giving the 
lower bounds in Theorem 2 are straightforward extensions of those given in [30, 31]. It 
follows that our main challenge is to prove that S X (H) ^ when M.(H) contains a 

forest, and that S X (H) ^ 0(H) when H is r- near-acyclic. 

The recent results of both Lyle [31] and Luczak and Thomasse [30] contain important new 
techniques, which we re-use and extend here. Most significantly, Luczak and Thomasse [30] 
introduced a concept which they termed paired VC- dimension, which is based on the 
classical Vapnik-Cervonenkis dimension of a set-system [41]. Our proof of Conjecture 1 
relies on an extension of this technique (see Section 6), together with a new embedding 
lemma (see Section 5) which allows us to find a copy of H in sufficiently many 'well- 
structured' copies of the 'Zykov graph', which is a universal near-bipartite graph. 

Lyle [31] introduced a novel graph partitioning method based on the celebrated Sze- 
meredi Regularity Lemma. We shall use his partition in Section 3, together with averaging 
arguments similar to those in [1], to prove that S X (H) ^ A(iJ) for any graph H such that 
M.(H) contains a forest. In Section 7 we shall combine and extend both techniques in 
order to generalise Conjecture 1 to arbitrary r ^ 3. 

Organisation. In Section 2 we state the Regularity Lemma in the form in which we 
shall use it, together with some auxiliary tools, and provide some notes on notation. In 
Section 3 we prove that S X (H) ^ A(i7) for any graph H such that M.(H) contains a 
forest. In addition we give a construction which shows that if Ai(H) does not contain a 
forest, then S X (H) ^ n(H). In Section 4 we provide a construction (using the Borsuk- 
Ulam Theorem) which shows that for graphs H which are not r- near-acyclic, we have 
S X (H) ^ A(if). In Section 5 we introduce a generalisation of the class of Zykov graphs, a 
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class of universal near-bipartite graphs which were used in [30]. We show that for every 
near-acyclic graph H , if G contains a suitably well-structured collection of Zykov graphs, 
then G contains H. Complementing this, in Section 6 we refine Luczak and Thomasse's 
paired VC-dimension argument to show that every graph with linear minimum degree and 
sufficiently large chromatic number indeed contains such a well-structured collection of 
Zykov graphs. Completing the proof of Theorem 2, in Section 7 we give a construction 
which shows that for any r-chromatic graph H we have S X (H) ^ &(H), and combine 
the results of Sections 5 and 6 with the Regularity Lemma in order to show that for r- 
near-acyclic graphs H, we have S X (H) ^ &{H). Finally, in Section 8 we conclude with a 
collection of open problems. 

2. Tools and the Regularity Lemma 

In this section we shall state some of the tools used in the proof of Theorem 2. In par- 
ticular, we shall recall the Szemeredi Regularity Lemma, which is one of the most powerful 
and important results in Graph Theory. Introduced in the 1970s by Szemeredi [38] in order 
to prove that sets of positive density in the integers contain arbitrarily long arithmetic pro- 
gressions (a result known as Szemeredi's Theorem [37]), it says (roughly) that any graph 
can be approximated well by a bounded number of 'quasi-random' graphs. The lemma 
has turned out to have an enormous number of applications, and many important exten- 
sions and variations have been proved (see for example [17, 23, 29, 34] and the references 
therein). The reader who is unfamiliar with the Regularity Lemma is encouraged to see 
the excellent surveys [24, 25]. 

We begin by stating the Regularity Lemma in the form in which we shall use it. Let 
(A, B) be a pair of subsets of vertices of a graph G. We write d(A, B) = i~nrjfl , and call 
d(A, B) the density of the pair (A,B). (Here e(A,B) denotes the number of edges with 
one endpoint in A and the other in B.) For each e > 0, we say that (A, B) is e-regular if 
\d(A,B) - d(X,Y)\ < e for every X C A and Y C B with \X\ ^ e\A\ and \Y\ ^ e\B\. 

A partition Vo U V\ U . . . U Vfc of V(G) is said to be an e-regular partition (or sometimes 
a Szemeredi partition of G for e) if \Vq\ ^ en, \V\\ = ... = \Vk\, and all but at most ek 2 of 
the pairs (Vi, Vj) are e-regular. We will often refer to the partition classes V\, . . . , T4 as the 
clusters of the regular partition. In its simplest form, the Regularity Lemma is as follows. 

Szemeredi's Regularity Lemma. For every e > and every ko G N, there exists a 
constant k\ = hi(ko,e) such that the following holds. Every graph G on at least k\ vertices 
has an e-regular partition into k parts, for some ko ^ k ^ k\. 

We shall in fact use a slight extension of the statement above, which follows easily 
from [24, Theorem 1.10] (a proof can be found in, e.g., [27, Proposition 9]). Given < d < 1 
and a pair (A, B) of sets of vertices in a graph G, we say that (A, B) is {e, d) -regular if it 
is e-regular and has density at least d. 

Given an e-regular partition V U V% U • • • U 14 of V(G) and < d < 1, we define a 
graph R, called the reduced graph of the partition, as follows: V(R) = [k] = {1, . . . , k} and 
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ij G E(R) if and only if (Vi,Vj) is an (e, d) -regular pair. We shall occasionally omit the 
partition, and simply say that G has (e, d) -reduced graph R. 

Szemeredi's Regularity Lemma (minimum degree form). Let 0<e<d<5<l, 
and let k G N. There exists a constant ki = ki(k ,£,5,d) such that the following holds. 
Every graph G on n > k\ vertices, with minimum degree 5(G) ^ 5n, has an (e,d)-reduced 
graph R on k vertices, with ko =Sj k ^ k\ and 5(R) ^ (5 — d — ejk. 

Thus the reduced graph R of G 'inherits' the high minimum degree of G. The main 
motivation for the definition of e-regularity is the following so-called 'counting lemma' 
(see [24, Theorem 3.1], for example). 

Counting Lemma. Let G be a graph with (e , d) -reduced graph R whose clusters contain 
m vertices, and suppose that there is a homomorphism 0: H — > R. Then G contains at 
least 

-, l —(d-e\H\) e{H) m\ H \ 

\Knt(H)\ y 1 Iy 

copies of H, each with the property that every vertex x G V(H) lies in the cluster corre- 
sponding to the vertex <p(x) of R. 

Note that since we count unlabelled copies of H, it is necessary to correct for the possi- 
bility that two different maps from H to G may yield the same copy of H (precisely when 
they differ by some automorphism of H). In fact, such an automorphism of H must also 
preserve 0, but dividing by the number of elements of the full automorphism group Aut(i?) 
provides a lower bound which is sufficient for our purposes. We state one more useful fact 
about subpairs of (e, (i)-regular pairs. 

Fact 3. Let (U, W) be an (e, d) -regular pair and U' CU,W CW satisfy \U'\ ^ a\U\ and 
\W'\ ^ a\W\. Then (U',W) is (e/a,d- e) -regular. 

We shall also use the following straightforward and well-known fact several times. 

Fact 4. Let F be a forest andG be a graph on n ^ 1 vertices. If e(G) ^ \F\n, then F C G. 

Proof. Since G has average degree at least 2\F\, it contains a subgraph G' with minimum 
degree at least \F\. It is easy to show that G' contains F; for example, remove a leaf and 
apply induction. □ 

2.1. Notation. We finish this section by describing some of the notation which we shall 
use throughout the paper. Most is standard (see [6], for example); we shall repeat non- 
standard definitions when they are first used. 

For each t G N, let [t] — {1, . . . , t}. We say that we blow up a vertex v G V(G) to size t 
if we replace v by an independent set of size t, and replace each edge containing v by a 
complete bipartite graph. Given disjoint sets X and Y, we shall write K[X, Y] for the edge 
set of the complete bipartite graph on X U Y, that is, the set of all pairs with one end in 
X and the other in Y. We write K s (t) for the complete s-partite graph with t vertices in 
each part: that is, the graph obtained from K s by blowing up each of its vertices to size t. 
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Given a graph G, we write E(G) for the set of edges of G, and e(G) for \E(G)\. We use 
both |Cr| and v(G) to denote the number of vertices of G. Given a set X C V(G), we write 
E(X) for the set of edges of G with both ends in X, and N(X) for the set of common 
neighbours of the vertices in X. If D C E(G), then N(D) denotes the set of common 
neighbours of V(-D), the set of the endpoints of edges in D. (In particular, if e = xy is an 
edge then N(e) = N({x,y}) = N(x) D N(y).) Further, we let C7[f)] denote the subgraph 
of G with vertex set V(-D) and edge set D, and write d(D) for the average degree of G[D] 
and 5(.D) for the minimum degree of C7[-D]. 

A graph G is said to be C -degenerate if there exists an ordering (vi, . . . ,v n ) of V(G) 
such that has at most C neighbours in {vx, . . . , v^} for every 1 ^ k ^ n — 1. Finally, 
if ei, . . . , G E(G), then we shall write e £ for the tuple (ei, . . . , e^). 

3. Graphs with large chromatic threshold 

In this section we shall categorise the graphs H with chromatic threshold greater than 
A (if). First observe that a trivial upper bound on S X (H) is given by the Turdn density 

r|g) = i im gM)^S g ;- 2 , 

1 ' ->•» g) x(n)-r 

since if 5(G) = 5n with 5 > n(H)n then if C G, by the Erdos-Stone Theorem [15]. 
Moreover, it is not hard to prove the following sufficient condition for equality, which can 
be found, for example, in [31]. Recall that Ai(H) denotes the decomposition family of H. 

Proposition 5. Let H be a graph with x{H) = r ^ 3. If A4(H) does not contain a forest, 
then 5 X {H) = ir{H) = 



For each k, I G N, we shall call a graph G a (A;, £)-Erdos graph if it has chromatic number 
at least k, and girth (length of the shortest cycle) at least I. In one of the first applications 
of the probabilistic method, Erdos [12] proved that such graphs exist for every k and I. 

Proof of Proposition 5. Let if be a graph with x(if) = r ^ 3 such that Ai(H) contains 
no forest, let C G N, and let G' be a (C, \H\ + 1)-Erd6s graph; that is, x(G') ^ C and 
girth(G') ^ [if | + 1. Let G be the graph obtained from the complete, balanced (r — 1)- 
partite graph on (r — 1)|G'| vertices by replacing one of its partition classes with G 1 . Then 
5(G) = r ^v(G), H£G, and >C. □ 

We remark that the same construction, with the complete balanced (r — l)-partite graph 
replaced by a complete balanced (r — 2)-partite graph, shows that, whatever the structure 
of H, its chromatic threshold is at least ( see Proposition 35). 

Lyle [31] showed that the condition of Proposition 5 is necessary. 

Theorem 6 (Lyle [31]). If x(H) = r ^ 3, then S X (H) < ir(H) = if and only if the 
decomposition family of H contains a forest. 

In this section we shall strengthen this result by proving that if S X (H) < 7r(if), then it 
is at most A (if). 
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Theorem 7. Let H be a graph with x{H) = r ^ 3. If M.(H) contains a forest, then 

2r - 5 



5 X {H) ^ 



2r - 3 



The proof of Theorem 7 is roughly as follows. Let 7 > 0, and let G be a sufficiently 
large graph with 

For some suitably small e and d, let Vo U . . . U be the partition of V(G) given by the 
minimum degree form of the Szemeredi Regularity Lemma, and R be the (e, d)-reduced 
graph of this partition. Define, for each I C [k], 

Xj := \v G V(G) : iel \N(v) fl K| ^ d]^]}. 

We remark that this partition was used by Lyle [31]. We show that x{G[Xj]) is bounded 
if H $Z G. We distinguish two cases. If |/| ^ (2r — 4)/(2r — 3), then it is straightforward 
to show that R[I] contains a copy of K r ~i, and hence, by the Counting Lemma, that N(x) 
contains 'many' (i.e., a positive density of) copies of K r -i(t) for every x G Xj. We then 
use the pigeonhole principle (see Lemma 9, below), to show that either \Xj\ is bounded, 
or H C G. 

If |/| ^ (2r— 4)/(2r — 3), then set V} = Ujg/ ^j'j an< i observe that every pair x, y G Xj has 
'many' common neighbours in Vj. We use a greedy algorithm (in the form of Lemma 9(a)) 
to conclude that every edge is contained in a positive density of copies of K r . Finally, we 
shall use a counting version of a lemma of Erdos (Lemma 8) together with the pigeonhole 
principle to show that when H (£ G, x{G[Xj]) is bounded (see Lemma 10). 

We begin with some preliminary lemmas. The following lemma from [1] will be an 
important tool in the proof; it is a counting version of a result of Erdos [13]. 

Lemma 8 (Lemma 7 of [1]). For every a > and s, t G N there is an a' = a' (a, s, t) > 
such that the following holds. Let G be a graph on n vertices with at least an s copies of K s . 
Then G contains at least a'n st copies of K s (t). 

We shall also use the following easy lemma, which is just an application of a greedy 
algorithm and the pigeonhole principle. Let G + H denote the graph obtained by taking 
disjoint copies of G and H, and adding a complete bipartite graph between the two. 

Lemma 9. Let a, 5 > and s, t G N ; let F be a forest, and suppose that H C F + K s (t). 
Let G be a graph on n vertices, and X C V(G). 

(a) If 5(G) ^ 5n and 

\X\ ^ (a 1/s s + (l-5)(s-l))n J 

then G[X] contains at least an s copies of K s . 

(b) If G[N(x)~\ contains at least an^ s+1 ^ H ^ copies of K s+ x(\H\) for every x G X, then 
either H C G or \X\ ^ \H\la. 
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Proof. For part (a), we construct copies of K s in G[X] using the following greedy algorithm: 
First choose an arbitrary vertex x± G X, then a vertex x 2 G X in the neighbourhood of X\, 
then X3 G A^(xi)nA^(x2)nX, and so on, until we find x s G X in the common neighbourhood 
of £1, . . . , x s -\. Clearly, G[{xi, . . . ,x s }~\ is a copy of K s . 
Now we simply count: for choosing Xj we have at least 

\X\ - (i - 1)(1 -£)n ^ a 1/s s • n ^ (s\a) ' s ■ n 

possibilities, so in total we have at least s\an s choices. Since the algorithm can construct 
a particular copy of K s at most in s! different ways, we have found at least an s distinct 
i^s-copies in G[X]. 

For part (b), simply observe that, by the pigeonhole principle, there is a copy T of 
K s+ i(\H\) in G such that T C G[iV(£)] for at least a\X\ vertices of X. Since H C 
K S+2 (\H\) this implies that either H C G or < □ 

The following result follows easily from Lemma 8. For a forest F and H C F + K s (t), it 
will enable us to draw conclusions about the chromatic number of an if -free graph which 
contains many i^ s+ 2-copies arranged in a suitable way. 

Lemma 10. For every a > and s,t G N, t/iere exists a' = a' (a, s,t) such that for every 
forest F and every graph H C F + K s (t), the following holds. Let G be an H-free graph 
on n vertices, and let X C V(G) be such that every edge xy G E[G[X\) is contained in at 
least an s copies of K s+2 in G. 

Then G[X] is (2\F\/ a 1 ) -degenerate, and hence x(G[X]) ^ (2\F\/a') + 1. 

Given a subgraph K of G, we say that an edge e = xy of G extends to e + K if 
V{K) ^ N(x, y). We say e extends to a copy of e + K s (t) if e extends to e + K for some 
copy -?T of K s (t) in G. 

Proof of Lemma 10. Let a' = a'(a,s,t) be the constant provided by Lemma 8, let xy G 
E(G[X\) and let G" = G[N(x) f)N(y)]. Then, by our assumption, G' contains at least an s 
copies of K s . By Lemma 8, it follows that G' contains a'n st copies of K s (t), so xy extends 
to at least a'n st copies of e + K s {t) in G. 

Now, let Xi, . . . , x\x\ be an ordering of the vertices of X with the property that X; has 
minimum degree in Gi := G[X \ {x±, . . . , for each 1 ^ i ^ \X\. In order to show 

that G[X] is (2|F|/a')-degenerate, it suffices to prove that 

e{Gi) ^ 

a' 

since then S(Gi) ^ 2|F|/a', as desired. 

Since each edge e G E(Gj) extends to at least a'n st copies of e + K s (t), it follows, by 
the pigeonhole principle, that there is a copy K' of K s {t) and a set C E(Gi) with 
l-Ejl ^ a'e(Gj), such that e extends to e + K' for every e G E^. Let G* be the graph with 
vertex set V{Gi) and edge set E { . Since H % G, it follows that F ^ G*. Thus, by Fact 4, 
we have 

|F| • \Gi\ > e(G*) ^ ^(Gi), 
as required. □ 
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We are ready to prove Theorem 7. We shall apply the minimum degree form of the 
Szemeredi Regularity Lemma, together with the Counting Lemma and Lemmas 9 and 10. 

Proof of Theorem 7. Let F be a forest, let r ^ 3, and let if be a graph with x{H) = r 
and F e M(H). Observe that we have H C F + K r _ 2 {\H\). Let 7 > 0, and let G be an 
if -free graph with 

^ > (£^l + " • 

where n = \G\. We shall show that x{G) is bounded above by some constant C = C(H, 7). 

The first step is to apply the minimum degree form of Szemeredi's Regularity Lemma 
to G, with 

7 r, r 7 (p 

(1) d := — , k := r and £ := min < — , — ; — 

V 7 2' ° \2' 2d + 2\H 

We obtain a partition V(G) = V U V\ U . . . U Vfc, where k ^ k ^ ki = k\{s, d, k ), with 
an (e, (i)-reduced graph i? such that 

We now partition the vertices of V(G) depending upon the collection of the sets Vi to 
which they send 'many' edges. More precisely, define V(G) = Ujcny Xi by setting 

X 7 := |u e |X» n V±\ ^ d\V±\\, 

for each I C [A;]. We claim that < max{Ci,C 2 + 1} for all i C [A;], where 

C\ = Ci(H, 7) and C2 = C^if, 7) are constants defined below. Since the X/ form a 
partition, this implies that x(C) ^ 2 k max{C 1 ,C 2 + 1} ^ 2 fel max{Ci,C 2 + 1} = C(if,7) 
as desired. 

In order to establish this claim we distinguish two cases. 

k. 







m > r 




1 1 \2r 





In this case we shall show that |Xj| ^ C\ (where C\ is a constant defined below, and 
independent of n), and thus trivially x(G[X/]) ^ C\. We first claim that R[I] contains a 
copy of K T _\. Indeed, by our minimum degree condition on R, we have 

6(R[I]) > 8(R)-(k-\I\) > |j| - - 7 ) * > (^| + 7)|/|. 

Thus, by Turan's Theorem (or simply by proceeding greedily), R[I] contains a copy of K r _i, 
as claimed. Let {W±, . . . , W^-i} Q {Vi, . . . , 14} be the set of parts corresponding to this 
copy of K r _i. 

Now let x G X 7 , set W[ = N(x) (1 Wi for each % G [r - 1], and note that \W[\ ^ d\Wi\, 
by the definition of X/. By Fact 3, each pair (IV/, Wj), i 7^ j, is (e/d,d — e)-regular. By 
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the Counting Lemma and (1), it follows that G[N(x)] contains at least 

copies of K r _i(\H\), for some a.\ = ai(H, 7) > 0. 

Thus, by Lemma 9(6) (applied with a = a±, s = r — 2 and X = Xj), we have either 
H C G, a contradiction, or \Xj\ ^ \H\/ai = Ci(H,j), as claimed. 

k. 







1/1 < r 




1 1 \2r 





In this case we shall show that is CVdegenerate (where C2 is defined below and 

independent of n), using Lemma 10. It follows that x(G[X/]) ^ C2 + 1. First, we shall 
show that every edge of G[Xj] is contained in at least 7 r_2 n r_2 copies of K r . 

Let Vi := Uig/^i denote the set of vertices in clusters corresponding to /, and let 
xy G E{G[Xj\). By the definition of Xi, x and y each have at most (d + e)n neighbours 
outside Vi, and thus, since d + e < 7, they each have at least (§^5§ + 7)^ neighbours in Vi. 
It follows that they have at least 2(|^5§ + 7) — \Vj\ common neighbours in Vj. Finally, 
since |/| ^ (ffz|)^) we have \Vi\ ^ {^E^) n i an d thus x and y have at least 



2r - 6 

h 7 I n 

2r -3 

Now, apply Lemma 9(a) with a = 7 r_2 , 5 = j-^f +7, s = r — 2, and X = N(x) C\N(y). 



common neighbours in Vj. 

Now, i 
We have 



2r - 6 . nN , n , A 2r - 5 

7 = ( r - 2)7 + (r - 3) 1 - - 7 



2r - 3 ' ' v '\ 2r - 3 



and so G[N(x) D iV(y)] contains at least 7 r_2 n r ~ 2 copies of ^-2, i.e., every edge of 
is contained in / -y r ~ 2 n r ~ 2 copies of K r . Hence, by Lemma 10 (applied with a = 7 r_2 , 
s = r — 2 and t = \H\), there exists a' = a'(7 r_2 ,r — 2, \H\) > such that is 
2\H\/a' = C 2 -degenerate, and so x(G[X/]) ^ C 2 + 1, as required. □ 



4. Borsuk-Hajnal graphs 

In this section we shall describe the constructions (based on those in [30]) which provide 
the lower bounds on S X (H) in Theorem 2. One of the main building blocks in these 
constructions is a class of graphs which also mark the first (and most famous) application 
of algebraic topology in combinatorics: the Kneser graphs Kn(n, k), which are defined as 
follows. Given n, k G N, let Kn(n, k) have vertex set (^'), the family of k- vertex subsets of 
[n], and let {S, T} be an edge if and only if S and T are disjoint. (For example, Kn(5,2) 
is the well-known Petersen graph.) These graphs were first studied by Kneser [21], who 
conjectured that x(Kn(n, fc)) = n — 2k + 2 for every n and k. This problem stood open 
for 23 years, until it was solved by Lovasz [28], whose proof led eventually to a new area, 
known as Topological Combinatorics (see [32], for example). 
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Hajnal (see [14]) used the Kneser graphs in order to give the first examples of dense 
triangle-free graphs with high chromatic number. Given k,£,m G N such that 2m + k 
divides £, let the Hajnal graph, denoted H(k,£,m), be the graph obtained as follows: 
first take a copy of Kn(2m + k,m), and a complete bipartite graph K^e/, with vertex set 
AU B, where \A\ = 2£, and \B\ = £; next partition A into 2m + k equally sized pieces 
A\, . . . , A2m+k', finally, add an edge between S G V r (Kn(2m + k, m)) and y G Aj whenever 
3 eS. 

The following theorem, which appeared in [14], implies that 5 x (Ks) ^ 1/3. 

Theorem 11 (Hajnal, 1973). For all v > and fceN there exist integers m and £q such 
that, for every £ ^ £q, the Hajnal graph G = H(k,£ J m) satisfies v(G) = 3£ + ( 2m ^[ k ) , 
x(G) ^ k + 2 and 5(G) ^ (| — v)v(G), and is triangle-free. 

In order to generalise Theorem 11 from triangles to arbitrary 3-chromatic graphs which 
are not near-acyclic, Luczak and Thomasse [30] defined the so-called Borsuk-Hajnal graphs. 
We shall next describe their construction. 

The Borsuk graph Bor(fc, e) has vertex set S k , the k- dimensional unit sphere, and edge 
set {xy : Z(x, y) ^ n — e}, where Z(x, y) denotes the angle between the vectors x and y. It 
follows from the Borsuk-Ulam Theorem (see [32], for example) that x(Bor(A;, e)) ^ k + 2 
for any e > 0. 

In order to construct Borsuk-Hajnal graphs from Borsuk graphs, we also need the fol- 
lowing theorem, which follows easily from a result of of Nesetfil and Zhu [33]. 

Theorem 12. Given £ G N and a graph G, there exists a graph G' with girth at least £, 
such that x(G') = x{G), and such that there exists a homomorphism from G' to G. 

Now, given k G N, a set W C S k with \W\ even, and e, 5 > 0, we define the Borsuk- 
Hajnal graph, BH = BH(V4 7 ; k, e, 5), as follows. 

First, let B = Bor(&, e) be the Borsuk graph, and let U C S = V(B) be a finite 
set, with U chosen such that x(B[U]) = k + 2. (This is possible by the de Bruijn-Erdos 
Theorem [11], which states that every infinite graph with chromatic number k' has a finite 
subgraph with chromatic number k'.) Let B' denote the graph given by Theorem 12, 
applied with G = B[U] and £ = k, let <p be the corresponding homomorphism from B' 
to -B[£7], and let U' be the vertex set of B' . 

Let X be a set of size | W|/2, and recall that K [W, X] denotes the edge set of the complete 
bipartite graph with parts M^and X. 

Definition 13 (The Borsuk-Hajnal graph). Define BH = BH(V4 7 ; k,e, 8) to be the graph 
on vertex set U'UWUX, where U', W and X are pairwise disjoint and as described above, 
with the following edges: 

E(B')UK[W 7 X] U {{w,u>} : u G U', w G W and l(<j){u),w) < | - 5^ . 



Observe that x(BH) > x(B[U\) > k. 
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Theorem 14 (Luczak and Thomasse [30]). For every fceN and v > 0, there exist e, 5 > 
and W C S k , such that, setting BH = BH(W; k,e,5), we have 

X(BH) ^ k and 6{BR) ^ Q-vU(BH). 

Moreover every subgraph H C BH < and x(-^0 = 3 is near-acyclic. 

Hence, for any H with x{H) = 3 which is not near-acyclic, we have S X (H) ^ 1/3. 

We shall generalise the Luczak-Thomasse construction further, as follows, to give our 
claimed lower bound on S X (H) for r-chromatic H which are not r- near-acyclic. 

Definition 15 (The r-Borsuk-Hajnal graph). Define BH r (W; k,e,8) to be the graph ob- 
tained from the Borsuk-Hajnal graph BH = BH(W; k,e, 8) by adding r — 3 independent 
sets Yi, . . . , Y r _ 3 of size \Yi\ = . . . = \Y r _ 3 \ = \W\, and the following edges: 

r-3 

|J K[Y u Yj] U \jK[Y u V(Bll)}. 

l^i<j'^7 — 3 i=l 

That is, we add the complete (r — 2)-partite graph on ^(BH) U Y\ U . . . U Y r _ 3 . 

The following result extends Theorem 14 to arbitrary r ^ 3. 

Theorem 16. For every r ^ 3, k G N and z/ > 0, there exist e, 5 > and 14 7 C S k , such 
that, setting BH r = BH^iy; k,e,5), we have 

/Or — 5 \ 

x(BH r ) ^ fc and 5(BH r ) ^ f — — --z/Ju(BH r ). 

Moreover every subgraph H C BH r w«£/i u (if) < and x{H) = r is r -near- acyclic. 
Hence, for any H with x{H) = r which is not r -near- acyclic, we have 8 X (H) f^zf- 

Theorem 16 follows easily from Luczak and Thomasse's argument for Theorem 14; for 
completeness, we shall provide a proof here. 

Proof of Theorem 16. As noted above, we have x(BH r (H / ; k, e, 5)) > k for every choice of 
W, e and 5. For the other properties, we shall choose W randomly, and e, 5 > as follows. 

Let r ^ 3, k G N, and v > be arbitrary, and choose 5 > such that the spherical cap 
of S k (centred around the pole) with polar angle | — <5 covers a (| — |)-fraction of S k . Set 
e = */(2*0, 

n := u(BH r (0;fc,e,5)) = |C/'|, 
and choose u>o sufficiently large such that 

, o . / u 2 w \ 1 /2r-5 \ ^ /2r-5 \/2r-3 \ 

(2) exp^-— j<- and - „j ^ > - uj + u j , 

which is possible because (2r — 3)/2 > 1. Draw u> points uniformly at random from S k , 
call the resulting set W and consider the graph BH r = BH^H 7 ; k, e, 5). 

We show first that, with positive probability, BH r has the desired minimum degree. Let 
Y = Yi U . . . U Y r _3, and recall that BH r has vertex set U'UWUXUY and that \U'\ = u . 
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Claim 17. With positive probability the following holds. For every v G V(BH r ), 

deg BH » > (^^-^J \ W V 

Proof of Claim 17. For v G W U X U Y, it is easy to check that deg BHr (t> ) ^ (^j^) \ W\. 
Moreover, if v G U' then y C N(v) and |V| = (r — 3)|W|. Thus it will suffice to show 
that the following event a holds with positive probability: For every v G U' we have 
deg BHr (^W)^(|-v)|W|. 

To this end observe that, for a given v G U', the value of deg BHr (w, W) is a random 
variable B with distribution Bin(|VK|, | — ~). This follows because was chosen uniformly 
at random from S k , by our choice of S, and since by Definition 13, v is adjacent to w G W if 
and only if Z((f)(v),w) ^ | — 5. Thus, by Chernoff's inequality (see, e.g., [19, Chapter 2]), 

P(degH Bf .(«,W0<(|-^)|W|) ^exp(-^) ? -L. 

By the union bound, the event a holds with positive probability, as required. □ 
Using (2), we have 

and so the desired lower bound on 5(BH r ) follows immediately from the claim. 

Finally, let us show that every subgraph H C BH, r with v(H) < k and x{H) = r is 
r- near-acyclic. We begin by showing that H' := H[U' U W U X] is near-acyclic. 

Observe first that if'[W] is independent, and recall (from Definition 13) that BH r [[/'] 
has girth at least k > v(H). Thus H'[U'UX] is a forest, since all of its edges are contained 
in U'. It therefore suffices to prove the following claim. 

Claim 18. Every odd cycle in H' contains at least two vertices of W. 

Proof of Claim 18. Let C be an odd cycle in H'. (Hence v(C) < k.) If V(C) nl/l 
then | V(C) PI W\ ^ 2 since e(U', X) = and X is independent. Thus we may assume that 
V(C) n X = 0. Similarly, since H'[U'} is a forest, we must have \V{C) D W\ > 1. 

Let P = i>i . . . i> p be a path in [/' with p < k and p even. Recall that 4>(vi), . . . , </>(v p ) 
are vectors from S k such that Z(0(vj), ^ 7r — e for alH G [p — 1]. We shall show 

that N B n r (vi) fl N Biir (v p ) D W = 0, i.e., that 0(t>i) and 0(f p ) do not lie in a common 
spherical cap with angle | — 5. Indeed, we have Z(0(w!), 0(t> 3 )) ^ 2e, and, in general, 
Z(0(wi),0(^ 2 j)) ^ 7r - 2je for every j G [p/2]. Hence ^((f)(vi), <f)(v p )) ^ tt - ke > 2(|— <5), 
and so, by Definition 13, t>i and v p do not have a common neighbour in W, as required. 

This implies that V(C) fl U' cannot be a path on v(C) — 1 vertices and thus we conclude 
\V(C)nW\^2. n 



Finally, note that as H\Yj\ is an independent set for each % G [r — 3], and H' is obtained 
by removing these sets, H is indeed r-near-acyclic, as required. □ 
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5. Zykov graphs 

In this section we shall prove a key result on Zykov graphs (see Definition 19 and Propo- 
sition 22, below), which will be an important tool in our proof of Theorem 2. Let G 1 be 
a graph with connected components C\, . . . , C m , and let G be the graph obtained from G' 
by adding, for each m-tuple u = («]_,..., u m ) £ C\ x ■ ■ ■ x C m , a vertex v u adjacent to 
each Uj. This construction was introduced by Zykov [42] in order to obtain triangle- free 
graphs with high chromatic number. 

We shall use the following slight modification of Zykov's construction. Recall that to 
blow up a vertex v £ V(G) to size t means to replace v by an independent set of size t, and 
replace each edge containing v by a complete bipartite graph, and that K (v, X) denotes 
the set of pairs {vx : x £ X}. 

Definition 19 (Modified Zykov graphs). Let T±, . . . , T% be (disjoint) trees, and let 7} have 
bipartition Aj U Bj. We define Zg{T\, . . . , Tg) to be the graph on vertex set 




and with edge set 

E(Z i (T 1 ,... 7 T t )) := [J (e(Tj) U |J tf(u/,A,-)U |J K(u!,B 3 ) 

j=i \ jeiQW j&QW 

For each r ^ 3 and t £ N, the modified Zykov graph Zj'^Ti, . . . , T^) is the graph obtained 
from Zt(Ti, . . . , T^) by performing the following two operations: 

(a) Add vertices = {u>i, . . . , w r _ 3 }, and all edges with an endpoint in W. 

(b) Blow up each vertex uj with I C [£] and each vertex in W to a set Sj or 5^-, 
respectively, of size t. 

Finally, we shall write Zj' for the modified Zykov graph obtained when each tree T{, i £ [£], 
is a single edge; that is, Z^ = Z r t ' (e±, . . . , eg). 

Note that Zp* has (2^ + r — 3)t + 2£ vertices, and that, in the special case r = 3 and 
t — 1, the graph Z£' coincides with that obtained by Zykov's construction (described 
above) applied to a matching of size £. 

The following observation motivates (and follows immediately from) Definition 19. 

Observation 20. Let x{H) = r. Then H is r -near-acyclic if and only if there exist trees 
Ti, . . . , Tg and t £ N such that H is a subgraph of Zj'*(Ti, . . . , Tg) . 

Proof. Recall that H is r-near-acyclic if and only if there exist r — 2 independent sets 
Ui, . . . , U r _ 3 , W such that H \ (W U |J ■ Uj) is a forest F whose components are trees 
Ti, . . . ,Tg with the following property. For each i £ [£], there is no vertex of W adjacent 
to vertices in both partition classes of Tj. If H C Z r /{Ti, . . . ,Tg) then we can take = 
U/cwi ^ an d Ui, . . . , U r s to be the sets £[, . . . , 5^._ 3 , and so H is r-near-acyclic, as claimed. 
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Conversely if H is r-near- acyclic, then H C Zj' (Ti, . . . , Te), where t = \H\ and Ti,...,Tg 
are the components of the forest F . □ 

It will be convenient for us to provide a compact piece of notation for the adjacencies 
in Zj' . For this purpose, given a graph G and a set Y C V(G), and integers t G N and 
r ^ 3, define Q^' (Y) to be the collection of functions 

S : 2 M U [r - 3] -)• 

It is natural to think of S as a family {Sj : I C [£]} U {5^ : j G [r — 3]} of subsets of Y 
of size t. We say that S G Q r /{Y) is proper if these sets are pairwise disjoint and -E(G) 
contains all edges xy with x G Si U and y G S^/, whenever j 7^ f. We shall write 
J^l^iY) for the collection of proper functions in Q^' l (Y). The idea behind this definition is 
that we will later want to consider a vertex set Y C V(G) and a family of disjoint subsets 
{Si : I C [£]]■ U {Sj : j 6 [r — 3]} of size £ in V that we want to extend to a copy of Zj'*. 

For an ordered pair (x, y) of vertices of G, a function S G J-J' (Y), and z G [£], we write 
(x, y) — K; 5, if Sj C AT (2c, for every j G [r — 3] and 

\J SiC N(x) and |J C N(y) . 

For an edge e = xy G E(G), we write e — >j 5 if either (x, y) — 5 or (y, x) — 5. Recall 
that denotes the £-tuple (e\, . . . , ei), with e° the empty tuple. Define 

e e -+ S ei ->•» 5 for each i G [£] . 

Observe that the graph Zj'* consists of a set of pairwise disjoint edges e\, . . . ,et and an 
5 1 G J c J'*(y) such that e f — >■ S 1 . An advantage of this notation is that we can write e £ — > S 
even if the edges in e e are not pairwise disjoint. This will greatly clarify our proofs. 

In Section 6, we shall show how to find a well-structured set of many copies of Zj'* inside 
a graph with high minimum degree and high chromatic number. The following definition 
(in which we shall make use of the compact notation just defined) makes the concept of 
'well-structured' precise. Recall that, given X C V(G), we write E(X) for the edge set of 
G[AT], and that if D C E(G), then 5(D) denotes the minimum degree of the graph G[-D]. 

Definition 21 ((C, a)-rich in copies of Zj' ). Let X and Y be disjoint vertex sets in a 
graph G, let C G N and a > 0, and let s := (2 l + r - 3)t. We say that (X, Y) is (C, a)-rich 
in copies of Z['* if 

3 £> = D(e°) C Vei G D 3 ^(e 1 ) C Ve 2 G ^(e 1 ) . . . 

. . . Ve*_! G D(e^ 2 ) 3D(e^ 1 ) C Ve £ G ^(e^ 1 ) 

the following properties hold: 

(a) 5(D),5(D(e 1 )),...,5(D(e e ~ 1 )) > C, and 

(b) \{SE (Y) : e e ->S} \ >a\Y\ s . 
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If (X, Y) is (C, a)-rich in copies of Zj'*, then, for each g G [£], define 

(3) V q (X,Y) := |e" G £(X) 9 : ej G D^' -1 ) for each j G [g]}, 

where T>(e°) := D. 

The aim of this section is to prove the following proposition, which says that if some pair 
(X, Y) in G is (C, a)-rich in copies of Zp* (where a > and C is sufficiently large), then 
for any 'small' Ti, . . . , T^ we have Z£' (Ti, . . . , T^) C G, and hence (by Observation 20) G 
is not H-fiee for any r-near-acyclic graph H. 

Proposition 22. Let G be a graph, and let X and Y be disjoint subsets of its vertices. Let 
r,£,t G N, wift r ^ 3 ; and let a > 0. Lei T x , . . . , T(_ be trees, and set C := 2 e+3 a~ 1 Ylt=i 1^1 ■ 
If (X, Y) is (C, a) -rich in copies of Z£ , then Zj' (Ti, . . . , Ti) C G. 

The proof of Proposition 22 uses a double counting argument and proceeds by induction. 
We shall find a set of functions S C J-^^fY) such that, for each S G <S, we can construct 
(one by one) a collection of subgraphs Ei,...,Ei of G[X] with the following properties: 
each subgraph has large average degree, and for any choice e\ G -Ei, . . . , eg G Ti, we have 
e e y S. Recall that this simply says that — >i S for every e, G Ti. 

Let the graph G, disjoint subsets X, Y C V(G), constants a > and r, t G N with 
r ^ 3, and trees Ti, . . . , Ti be fixed for the rest of the section. Set C := 2 i+3 a~ 1 Yll=i 1^1 
and s := (2 e + r — 3)t, and let ^ q ^ £. For our induction hypothesis we use the following 
definition. 

Definition 23 (Good function, (C, a)-dense). A function S G JF^^Y) is {r,£,t,C,a)-good 
for a tuple e q and (X, Y") if there exist sets 

E q+1 , ...,E e C E(X), with d(Ej) ^ 2~^aC for each q + 1 ^ j < 

such that for every e g+ i G T? g +i, • • • > G T^, we have e e — > S. 

When the constants (r, £, t, C, a) and the sets (X, Y) are clear from the context, we shall 
omit them. We shall abbreviate '(r, £, t, C, a)-good for e° and (X, Y)' to '(r, £, t, C, a)-good 
for (X, Y)\ 

The pair (X, Y) is (C, a) -dense in copies of Zj'* if there exist at least 2~^a|Y| s families 
S G J*(Y) which are (r, i, C, a)-good for (X, Y). 

The next lemma constitutes the inductive argument in the proof of Proposition 22. The 
final assertion we shall also need in Section 7. 

Lemma 24. For any ^ q ^ £, if (X, Y) is (C, a)-rich in copies of Z['* ; f/ien 

|{5 G JI'*(Y) : 5 zs good /or e c/ }| ^ 2^a|Y| s 

/or even/ e 9 G T> q (X, Y). 

In particular, if (X, Y) is (C,a)-rich in copies of Zj' , i/ien (X, Y) is (C,a)-dense in 
copies of Zj' . 
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Proof. The proof is by induction on t — q. The base case, q = £, follows immediately from 
the definition of (C, a)-rich. Indeed, a function 5* is good for e e if and only if e e — > S, and 
by Property (b) in Definition 21, we have \{S G J^' (Y) : e e — > S}\ ^ 

So let ^ q < £, assume that the lemma holds for q + 1, and let e q G T> q (X, Y). Set 
P = a2 q ~ l . By Definition 21, there exists a set D(e q ) C E{X) with 5(D(e q )) > C, such 
that e q+1 G 'D (?+ i(X, y) for every e q+ \ G D(e q ). Thus, by the induction hypothesis, 

(4) Ve 9+1 G £>(e 9 ) at least 2fi\Y\ s functions S G ^(F) are good for e q+1 , 

that is, for each such S there exist sets 

E q+2 ,...,E e CE(X), with d{E j )^2~ t aC for each g + 2 ^ j ^ £, 

such that for every e q+ 2 G -Eq+2, • • • , &i G i^, we have e — >■ 5". It is crucial to observe that 
given S", if the edge sets -Eq+2, • • • , Eg have this last property for some e q+ i G D{e q ) with 
e q+i - t ] ien _£; 9+2) . . . ; Eg have this property for all e q+1 G D(e q ) with e 9+1 — > S. 

The S" G J-J'*(Y) that will be good for e 9 are those which are good for many e q+1 . More 
precisely, for each S G J-J' (F), let 

W s := {e^+i G £>(e 9 ) : S is good for e q+1 }, 

and let Z = {S G Jj'^Y) : |W S | > /^(e 9 )!}. By (4) the number of pairs (e q+1 , S) with 
e q+ \ in is at least |D(e 9 )| ■ 2/3|F| s . On the other hand, for every S G Z there are at 
most ^(e 9 )! pairs (e 9+ i, 5*) with e q+ \ in Ws, and for every S G J^' (Y) \ Z, there are (by 
definition of Z) at most j3\D(e q )\ such pairs. Putting these together, we obtain 

\D(e q )\ ■ 2(3\Y\ S <: \D(e q )\\Z\ + /3\D(e q )\\Y\ s 

and hence \Z\ ^ f3\Y\ s . 

We claim that every S G Z is good for e 9 . Indeed, fix S* G Set = Ws, and 

let £"5+2, • • • , Ei be the sets defined above (for any, and thus all, e q+ i G E q+ i), i.e., those 
obtained by the induction hypothesis. Since S G Z we have \Wg\ ^ /3|-D(e 9 )|, so it follows 
from 5(D(e q )) > C that d(E q+1 ) ^ (3C ^ 2- £ aC. Since 5 is good for e q+1 for every 
e 9+1 G .Eg+i, we have — S 1 for every 1 ^ z ^ g + 1, and by the induction hypothesis, we 
have ej — 5* for every G and every q + 2 ^ i ^ £. Thus e £ — ?■ S* for every such e £ , as 
required. Since \Z\ ^ /5 1 V | s , this completes the induction step, and hence the proof of the 
lemma. □ 

Lemma 24 shows that richness in copies of Z r /" implies denseness in copies of Z r / ' . 
Observe that if (X, Y) is dense in copies of Zj' , then in particular there is a function 
S G JY{Y) which is good for (X,Y). The next lemma now shows that in this case we 
have Z%\T u ...,T t ) C G. 

Lemma 25. Let X and Y be disjoint vertex sets in G. Given r,£,t G N, a > 0, and trees 
Ti, . . . , Ti, ifC> 2 e+3 a- 1 YLi \Ti\ and S G J^'\Y) is (r, £, t, C, a)-good for (X, Y), then 
Z r /(T u ...,Ti)CG. 
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Proof. Let S be (r, £, t, C, a)-good for (X, Y). Then there exist sets 

E 1 ,...,E e C E(X), with d{Ej) ^ 2~ e aC for each 1 ^ j ^ £, 

such that for every e\ G . . . , G we have e £ — >■ S. 

For each j G [£] and each edge e G Ej, let e = be such that (x, y) — S 1 , and orient 
the edge e from x to Recall that C ^ 2 f+3 a _1 J^ i=1 |Tj|, and so ^(E^) ^ 8 J2i=i 1^1 f° r 
each j G [£]. For each j G [£], by choosing a maximal bipartite subgraph of £y, and then 
removing at most half the edges, we can find a set Ej C £^ such that, 

(a) E^ is bipartite, with bipartition (Aj,Bj), 

( 5 ) every edge e G -Ej is oriented from Aj to , and 

{c)d{E>)>2Y,U\n- 
Thus, by Fact 4, there exists, for each j G [£], a copy T- of 7} in 

£5 - (t/(tOu...uf(t;_ 1 )), 

since removing a vertex can only decrease the average degree by at most two. These trees, 
together with S, form a copy of Zp*(7i, . . . , T £ ) in G, so we are done. □ 

It is now easy to deduce Proposition 22 from Lemma 24 and Lemma 25. 

Proof of Proposition 22. Let (X, Y) be (C, a)-rich in copies of Z^' , and apply Lemma 24 
to (X, Y) with q = 0. Note that £>oP^ ^) — { e °} consists of the tuple of length zero, and 
let S = {S e y/iY) : S is good for e }. Then \S\ ^ a2~ e \Y\ s , and so in particular S is 
non-empty. Let S E S, and apply Lemma 25 to obtain a copy of Zj'*(Ti, . . . , T^) in G. □ 

6. The paired VC-dimension argument 

In this section we shall modify and extend a technique which was introduced by Luczak 
and Thomasse [30], and used by them to prove Conjecture 1 in the case where H is near- 
bipartite. This technique is based on the concept of paired VC-dimension, which generalises 
the well-known Vapnik-Cervonenkis dimension of a set-system (see [35, 41]). We shall not 
state our proof in the abstract setting of paired VC-dimension, which is more general than 
that which we shall require, but we refer the interested reader to [30] for the definition and 
further details. 

We shall use the paired VC-dimension (or 'booster tree') argument of Luczak and 
Thomasse in order to prove the following result, which may be thought of as a 'count- 
ing version' of Theorem 5 in [30]. The case r = 3 of Theorem 2 will follow as an easy 
consequence of Propositions 26 and 22 (see Section 7). 

Proposition 26. For every £, t G N and d > 0, there exists a > such that, for every 
C G N, there exists C G N such that the following holds. Let G be a graph and let X and 
Y be disjoint subsets ofV(G), such that \N(x) nY\ ^ d\Y\ for every i6l. 
Then either x(G[X\j ^ C' , or (X, Y) is (C,a)-rich in copies of Z\ l . 
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In order to prove Proposition 26, we shall break X up into a bounded number of suitable 
pieces, Xi, . . . , X m , and show that either G[Xj] has bounded chromatic number, or (Xj, Y) 
is (C, a)-rich in copies of Z/*. 

Let d(x,Y) := d({x},Y) = \N(x) n Y\/\Y\ = e({x},Y)/\Y\ be the density of the 
neighbours of x in Y. The key definition, which will allow us to choose the sets Xj, is as 
follows. 

Definition 27 (Boosters). Let G be a graph, let X and Y be disjoint subsets of V(G), 
and let e > 0. We say that x G X is e-boosted by Y' C F if F') ^ (1 + e)d(x, F). 

Now let C,p G N, and let /3 > 0. Let F X U. . .UF p of F be a partition of F, andX U. . .UX p 
be a partition of X. We say that (X , 0), (X 1; Fi), . . . , (X p , F p ) is a (p, C, e, (3)-booster of 
(X,F) if 

(a) G[X ] is C-degenerate. 

(6) Every x G is e-boosted by Yj, for each j G [p]. 
(c) |Fj| ^ /3|F| for every j G [p]. 

We say that a partition {F 1; . . . , F p } of F induces a (p, C, e, /3)-booster of (X, F) if there 
exists a partition X U . . . UX p of X such that (X , 0), (X 1; Fi), . . . , (X p , F p ) is a (p, C, e, 
booster of (X, F). 

We remark that this is slightly different from the definition of a p-booster in Section 5 
of [30], where Condition (a) was replaced by 'G[Xo] is independent', and Condition (c) 
was missing. 

Using Definition 27, we can now state the second key definition. 

Definition 28 (Booster trees). Let C,po G N and 0, e > 0. A (p ,C,e, (3) -booster tree for 
(X, F) is an oriented rooted tree T, whose vertices are pairs (X', Y') such that I'CI and 
Y' C F, and all of whose edges are oriented away from the root, such that the following 
conditions hold: 

(a) The root of T is (X,F). 

(6) No vertex of T has more than p out-neighbours. 

(c) The out-neighbourhood of each non-leaf (X', F') of T forms a (p, C, e, /3)-booster of 

(X', F') for some p ^ p . 
(c/) If (X', F') is a leaf of T, then either G[X'] is C-degenerate, or there does not exist 

a (p, C, e, /3)-booster for (X', F') for any p ^ p . 

A vertex (X', F') of T is called degenerate if it is a leaf of T and G[X'] is C-degenerate. 

The following lemma is immediate from the definitions. 

Lemma 29. Let C, po G N, and /3,e > 0, let G be a graph, and let X and Y be disjoint 
subsets ofV(G). If d(x,Y) ^ d for every x G X, then there exists a (po,C, e, (3) -booster 
tree T for (X, F) such that \T\ is bounded as a function of e, d and po- 

Moreover, if (X', F') is a non- degenerate vertex of T, then d(x, Y') ^ d for all x G X', 
and \Y'\ ^ /3 |r| |F|. 
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Proof. We construct T, with root (X, F), as follows: We simply repeatedly choose a 
(p, C, e, /3)-booster (X , d>), (X 1 ,Y 1 ), . . . , (X p , Y p ) for each leaf (X', Y') of T such that G[X'\ 
is not C-degenerate, until this is no longer possible for any p ^ po- We add to T the vertices 
(X , 0), (Xi, Fi), . . . , (X p , F p ) as out-neighbours of (X', F'). 

By the definition of 'e-boosted' and the construction of T, if (X', Y') is a non-degenerate 
vertex of T at distance t from the root, we have d(x,Y') ^ (1 + e)*d for every x G X', 
and we have |F'| ^ /3*|F| ^ /3' r '|F|. This both establishes that the height h(T) of T is 
bounded in terms of e and d, and that we have d(x, Y') ^ d for every a; G X'. Since T has 
no vertex of out-degree greater than p and /i(T) is bounded by a function of e and d, it 
follows that \T\ is bounded as a function of e, d and po- □ 

The following lemma is the key step in the proof of Proposition 26. 

Lemma 30. Let and d > 0. Lei (3 = (~) and 5 = /3/2. There exists a > snca 

taat tae following holds for every C G N. Lei G be a graph, let X and Y be disjoint subsets 
of V(G), and suppose that \N(x) fl F| ^ d\Y\ for every x G X. 

// taere does not exist a (p,C,e, (3) -booster of (X, F) /or any p ^ 2 £ , taen (X, F) is 
(C, a) -rich in copies of Z e ' t . 

Lemma 30 is proved by repeating a fairly straightforward algorithm £ times, at each step 
q G [£} finding a set D(e q ) as in the definition of (C, a) -richness (see Definition 21). In 
order to make the proof more transparent, we shall state a slightly more technical lemma, 
which is proved by induction on q, and from which Lemma 30 follows immediately. 

The following definition will simplify the statement. It is a slight strengthening of the 
concept of (C, a)-richness in the case r = 3. Recall that e e is just a shorthand for (e\, . . . , e^) 
and e° is the empty tuple. Further, recall the definitions of T^ 1 and e e — > S from Section 5. 

Definition 31 ((C, a, £)-Zykov). Let X and F be disjoint vertex sets in a graph G, let 
C, i G N and a > 0. We say that (X, F) is (C, a, i)-Zykov if 

3D = D(e°) C £(X) Ve x G D 3 -D(e x ) C £(X) Ve 2 G Z^e 1 ) . . . 

. . . Ve*_i G D(e^ 2 ) 3D(e w ) C £(X) V e e G Dfe'" 1 ) 

the following properties hold: 
(a) <5( J D),5(D(e 1 )),...,5( J D(e^ 1 )) > C, and 
(6) 3 S* G jf a|y| (F) such that e e -)• 5. 

We remark that the requirement (&) of Definition 21 that each tuple e e should extend to 
many copies of Zf'* is replaced in this definition by the requirement that e e should extend 
to one much bigger copy of In particular, if (X, F) is (C, a, £)-Zykov, then, for any 

t G N, it is (C, a')-rich in copies of Z e '*, where a' = (j) (this is shown in the proof of 
Lemma 30). 

Lemma 32. Let f G N and d > 0. For (3 = and e = (3/2, the following holds for 
every C G N. Let G be a graph, let X and Y be disjoint subsets ofV(G), and suppose that 
\N(x) fl F| > d\Y\ for every x G X. 
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If there does not exist a (p,C,e, (3) -booster of (X,Y) for any p ^ 2 e , then (X,Y) is 
(C, a q , q)-Zykov for every q G [£], where a q = (j) 9 ■ 

Proof. Let C, £ G N and d > 0, and let (3 = and e = (3/2. Let G and X,Y be as 
described in the statement, and suppose that there does not exist a (p, C, e, /3)-booster of 
(X, Y) for any p ^ 2 e . We proceed by induction. 

We begin with the base case, q = 1. We are required to find a set D = D(e°) C E(X), 
with 5(D) > C, such that, for every e\ = xy G D, there exists S^e 1 ) G J^ ,ai ' y ' SU ch 
that e 1 — > ^(e 1 ); that is, there exist disjoint sets S^e 1 ) and Sj^e 1 ) in Y, both of size 
ai \Y\ = f \Y\, such that ^(e 1 ) C N(x) and S^e 1 ) C N(y). Since there is no (1, C, £,/?)- 
booster of (X, Y), it follows that G[X] is not C-degenerate, and so there exists a subgraph 
G C with <5(G ) > C. We choose £> := £(G ). Now for each e x = xy G D, let 

^(e 1 ) := N(y) n F and ^(e 1 ) := iV(x) fl F. Since ^(e 1 )], ^(e 1 )! ^ d|F| by the 
assumption of the lemma, there exist disjoint sets ^(e 1 ) C A^e 1 ) and 5'{i}(e 1 ) C ^{^(e 1 ) 
with |50(e 1 )|, |5{i}(e 1 )| = f |F|, as required. 

For the induction step, let 1 < q ^ I and assume that the result holds for q — 1. By this 
induction hypothesis 

3D(e°) C E(X)Vei G D(e°) 3 Z^e 1 ) C E(X) Ve 2 G ^(e 1 ) . . . 

. . . Ve g _ 2 G D(e«- 3 ) 3 D(e 9 ~ 2 ) C E(X) Ve,^ G D(e^ 2 ) 

we have 

(a*) 5(D(e )),5(D(e r )),...,5(D(e"- 2 )) > C, and 
(b*) 3S(e^) G jJ^- l|y| (Y) such that e^ 1 -»• S(e^). 
As in Definition 21, set 

V q (X,Y) ■= {e 9 G E(X) g : e,- G ^(e^ 1 ) for each j e [q]\ . 

We shall show that for every e 9-1 G T> q ^i(X, F), there exists a set of edges D(e q ~ 1 ) C 
-E(X), with ^(/^(e 9 " 1 )) > C, such that for every e q G D(e 9_1 ) there exists an S(e q ) G 

7^' a9 ' F| (F) such that e q -»■ S(e«). 

Indeed, given e g_1 G V q _i(X, F), by (6*) there exists 

{^/(e"- 1 ) CF : J C [q - 1]} = ^(e 9 - 1 ) G J^r^'OO 

with e 9_1 —7- S^e 9-1 ). In particular, note that by definition of (F), the sets 

Si(e q ~ 1 ) are disjoint, and that \Si(e q ~ 1 )\ = a q -i\Y\ for every I C [q — 1]. Let R = 
Y\\Jj Si(e q ~ 1 ), and recall that q ^ £, and that there is no (p, C, e, /3)-booster of (X, F) for 
any p ^ 2f Thus the partition S(e q ~ 1 ) U {i?} of F does not induce a (p, C, e, /3)-booster 
of (X,F). 

By our choice of (3, we have /3 ^ ccg-i, and thus ^/(e 9 " 1 )! ^ /3|F| for all / C [q — 1]. 
Similarly, since 2 q ~ 1 a q ^ 1 = 2 g ~ 1 (c//4)^ 1 ^ 1/2 < 1 - 0, we have \R\ > (3\Y\. Let X' C X 
be the set of vertices which are not e-boosted by any of the sets S(e q ~ 1 ) U {R}. Since 
S(e q ^) U {R} does not induce a (2 q ~ l + 1, C, e, ^-booster of (X,Y), the graph GLY'] 
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is not C-degenerate, and hence there exists a set of edges /^(e 9 " 1 ) C E(X') such that 
<5(-D(e 9-1 )) > C. We claim that this is the set we are looking for. 

In order to verify this, let e q = xy G D(e q ~ l ) be arbitrary. Our task is to show that 
there exists S(e q ) G jr^> a v\ Y \ such that e 9 — > S(e q ). Recall that x and y are not e-boosted 
by 5(e"~ 1 ) U {R}. Hence d(x, U) < (1 + e)d(x, Y) for each U G S^' 1 ) U {R}, and so, for 
every I C [g — 1], 

e(x,5 / (e^ 1 )) = e(x,F)-e(a;,F\ 1 S / (e^ 1 )) 
(5) ^ e(x, Y)-(l + e) (l - (f) 9 " 1 ) e(x, F) ^ ~ ? ' e(x, F) , 

where we used |SV(e 9_1 )| = a 9 _i|F| = 1 \Y\ for the first inequality, and e = |(|) for 

the second. By the same argument, y has at least | (I) 9 1 e(?/, F) neighbours in Si(e q ~ 1 ) 
for each / C [q — 1] . 

Define, for each I C [g], the set yl/(e 9 ) C F as follows: 

A/(e'):=iV(x)n% ?} (e^) if g G / 
A/(e 9 ) := N(y) n ^/(e 9 " 1 ) if g ^ I. 

Since e(x, F), e(y, F) ^ <i|F|, we conclude from (5), that we have |A/(e 9 )| ^ 2 |F| for 
every i" C [g]. Moreover, the sets Aj{e q ) and Aj{e q ) are disjoint unless / \ {g} = J \ {g}. 
Hence we may choose disjoint sets S 7 (e 9 ) C A 7 (e 9 ) with |S/(e 9 )| = (f) 9 |F| for each / C [g]. 

Let S{e q ) = {Si(e q ) : I C [g]|. We claim that this is the desired family; that is, that 
S(e q ) G J^' aq[Yl {Y) and e q S(e q ). Indeed, the sets S^e") are disjoint, and 

( i\ \Y\ = a q \Y\ 



for each / C [g], by construction. Finally, we prove that e q — > S(e q ), i.e., that — >i S(e q 
for each i G [g]. For % ^ g — 1, this follows because e 9_1 — >■ S^e 9 ^ 1 ), and 



for every / C [g — 1] by (6). For % — q, it follows since Si(e q ) C A^(x) if g G / C [g] and 
^(e 9 ) C N(y) if g G" / C [g] by (6). Hence e 9 — >• 5(e 9 ), as required. This completes the 
induction step, and hence the proof of the lemma. □ 

We can now easily deduce Lemma 30. 

Proof of Lemma 30. By Lemma 32 (applied with g = £), it suffices to show that if (X, F) 
is (C, ae, £)-Zykov, then it is (C, a)-rich in copies of Zf f , where ag = (|) and a = (^f) 2 *■ 
In other words, we want to prove that if there exists S G J-^ 3 ' Qf ' y '(F) with — > S, then 

|{S"e Ff\Y) : e e ^S'}\ ^ a\Y\ s , 
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where s = 2H. Indeed, this is true because \Si\ = otg\Y\ for i C [£], and the number of 
ways of choosing, for each / C [£], a t-subset of Si is 



n 



as claimed. □ 
It is now straightforward to prove Proposition 26. 

Proof of Proposition 26. Let C,£,t G N and d > 0, and set f3 = (f) e and e = (3/2. Let G 
and (X,Y) be as described in the statement, so \N(x) f\Y\ ^ d\Y\ for every x G X. By 
Lemma 29 there exists a (2 e , C, e, /3)-booster tree for (X, Y), and moreover |T| is bounded 
as a function of d, e and £. 

Recall that the leaves of T correspond to a partition of X (and a partition of Y). If 
every leaf (X', Y') of T is degenerate then x(G[X]) ^ |T|(C + 1) =: C, where C depends 
only upon C, £ and d. So we may assume that some leaf (X', Y') G V(T) is not degenerate. 

By the definition of a (2 e , C, e, /3)-booster tree, it follows that there is no (p, C, e, (3)- 
booster of (X',Y') for any p 2 l . Further, \N(x) PI F'| = d(x,Y')\Y'\ > d|y'| for every 
x G X' by Lemma 29. Then, by Lemma 30 (applied with £, t and d), (X', Y') is (C, a')-rich 
in copies of Z £ ' , for some a' = a'(d, £, t) > 0. Since (again by Lemma 29) \Y'\^ /3l r l|F|,it 
follows that (X, Y) is (C, a)-rich in copies of Z^'*, where a = a'(3^ s is a constant depending 
only on d, £ and t, as required. □ 

7. The proof of Theorem 2 

In this section we shall complete the proof of Theorem 2. As a warm-up, we begin with 
the case r = 3, which is an almost immediate consequence of the results of the last four 
sections. 

The following theorem proves Conjecture 1. The proof does not use the Regularity 
Lemma; it follows from Propositions 22 and 26. 

Theorem 33. If H is a near-acyclic graph, then S X (H) = 0. 

Proof. Let if be a near-acyclic graph (so in particular x{H) = 3), let 7 > be arbitrary, 
and let G be an if- free graph on n vertices, with 5(G) ^ 2 / -/n. We shall prove that the 
chromatic number of G is at most C', for some C' = C'(H, 7). 

First, using Observation 20, choose t G N and a collection T±, . . . ,T# of trees such that 
H C Z^'*(Ti, . . . , Tf ). Choose a maximal bipartition (X, Y) of G, assume without loss of 
generality that x(G[X}) ^ x(G f [l r ]), and note that \N(x) PI Y\ ^ ^y\Y\ for every x G X. 

Let a > be given by Proposition 26 (applied with £, t and 7), let C := 2 e+3 a~ 1 Yl\=\ 1^1 j 
and apply Proposition 26. We obtain a C" = C'(H,j) > such that either ^ 
2x(G[X)) ^ 2C\ or (XX) is (C, a)-rich in copies of Zf . 

In the former case we are done, and so let us assume the latter. By Proposition 22 
and our choice of C, it follows that Z^ ,t: (Ti, . . . ,Te) ^= G. But then H C G, which is a 
contradiction. Thus x(G) is bounded, as claimed. □ 
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The case r = 3 of Theorem 2 now follows from Proposition 5, and Theorems 7, 14 and 33. 

Proof of the case r = 3 of Theorem 2. Let if be a graph with x{H) = 3, and recall that 
M.[H) denotes the decomposition family of H . By Proposition 5, if M.(H) does not contain 
a forest then 5 X (H) = ~, and by Theorem 7, if Ai(H) does contain a forest then S X (H) ^ |. 

Now, by Theorem 14, if ff is not near-acyclic then S X (H) ^ |, and by Theorem 33, if if 
is near-acyclic then S X (H) = 0. Thus 

5 X (H) G {0, 1/3, 1/2}, 

where S X (H) ^ \ if and only if if has a forest in its decomposition family, and S X (H) = 
if and only if if is near-acyclic, as required. □ 

The rest of this section is devoted to the proof of the following theorem, which generalises 
Theorem 33 to arbitrary r ^ 3. 

Theorem 34. Let H be a graph with x{H) = r ^ 3. If if is r -near- acyclic, then 

We begin with the lower bound, which follows by essentially the same construction as 
in Proposition 5. 

Proposition 35. For any graph if with x{H) = r ^ 3, we have S X (H) ^ ^5§. 

Proof. We claim that, for any such if, and C, there exist if-free graphs on n ) n 
vertices, with minimum degree ^5§n, and chromatic number at least C. Recall that we 
call a graph a (k, £)-Erd6s graph if it has chromatic number at least k and girth at least £, 
and that such graphs exist for every k, i G N. 

Let G' be a (C, \H\ + 1)-Erd6s graph on at least no vertices, and let G be the graph 
obtained from the complete, balanced (r — 2)-partite graph on (r — 2)|G'| vertices by 
replacing one of its partition classes with G' . Then G is if-free, since every | if |- vertex 
subgraph of G has chromatic number at most r — 1. Moreover, 5(G) = ^E^n and x(G) ^ C> 
as required. □ 

We now sketch the proof of the upper bound of Theorem 34. Let G be an n- vertex, if -free 
graph with minimum degree (f^E§ + 3 / y)ri. Let Ti, . . . , Tg be such that if C Z£'*(Ti, . . . , T e ). 
First, we take an (e, <i)-regular partition, using the degree form of the Regularity Lemma 
(where e and d will be chosen sufficiently small given 7). We then construct a second 
partition V of V(G), similar to that used in the proof of Theorem 7. Our aim is to show 
that x(G[X}) ^ C for each X G V. 

In the next step, we observe that the minimum degree condition guarantees that for 
each X G V, there are clusters Y and Z 1; . . . , Z r _ 3 of the (e, <i)-regular partition with the 
following properties. First, for each v G X we have dy(v) ^ j\Y\, and for each % G [r — 3] 
we have dz^v) ^ (| + 7) |Zj|. Second, F, Z 1; . . . , Z r _ 3 forms a clique in the reduced graph 
of the (e, (i)-regular partition. 
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Now recall that Zj'*(Ti, . . . ,Te) contains independent sets Sj for each I C [£], and inde- 
pendent sets Si for each i G [r — 3]. The idea now is to show that if x(G[X]) ^ C does 
not hold, then we find a copy of Z]"'*(Ti, . . . , Tg) in which the trees 7\, . . . , Ti lie in X, the 
independent sets Sj lie in Y, and 5j lies in Zj for each % G [r — 3], which contradicts the 
assumption that G is H-free. 

In order to achieve this, we work as follows. We apply the paired VC-dimension argument 
(Proposition 26) to (X,Y), with constants £* and t* which are much larger than £ and t, 
and a very large C*. This yields our C and an a > such that either x(G[X]) ^ C (in 
which case we are done), or (X, Y) is (C*, a)-rich in copies of Z^** . 

In the latter case, we apply Lemma 24 to conclude that (X, Y) is (C*, a)-dense in copies 
of Zff . The main work of this section then is to show (in Proposition 36) that this 
implies that there is an S G T[' (Y U Z\ U • ■ • U Z r _ 3 ) such that 5 is (r, £, t, C, a)-good 
for (X, Y U Zx U ■ • • U Z r _3). Finally, applying Lemma 25 we find that there is a copy of 
Z['*(7i, . . .,7>) in G. 

As just explained, the following proposition is the main missing tool for the proof of 
Theorem 34. 

Proposition 36. For every r > 3, £, t G N and d, 7 > there exist £*, t* G N such that for 
every a > and CeN, t/iere exzsi £1 > and C* G N, such that for every < e < e\ the 
following holds. 

Let G be a graph, and let X, Y and Zi,...,Z r _3 be disjoint subsets ofV(G), with 
\Y\ = \Zj\ for each j £ [r-3]. Let Z :— Z\ U • • • U Z r „3. Suppose that (Y, Zj) and (Zj, Zj) 
are (e, d) -regular for each i 7^ j ; ana 1 £na£ 

ix^nz,-!^ Q + 7)1^1 

/or every x G X ana 1 j G [r — 3] . 

// (X, y) is (C* , a) -dense in copies of Z\? , then there is some S G T'^iY U Z) siic/i 
iaai 5 is (r, t, C, a)-good for (X, Y U Z). 

For the proof of this proposition, we combine an application of the Counting Lemma 
and two uses of the pigeonhole principle. As a preparation for these steps we need to 
show that there exists a family S* G J 7 ^'* which is (3, £*, t*, C*, a)-good for (X, Y) and 
'well-behaved' in the following sense. For each of the sets Sj C Y given by Sj only a small 
positive fraction of the (r — 3)t-element sets in Z has a common neighbourhood in S} of 
less than t vertices. To this end we shall use the following lemma. 

Recall that for a set T of vertices in a graph G, we write 



N(T): = f]N(x). 



Lemma 37. For all r,t G N and fi, d > 0, there exist t* = t*(r,t, fi,d) G N and Eq 
Eo(r, t, fi, d) > such that for all < e < eq the following holds. 
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Let G be a graph, and suppose that Y and Z±, . . . , Z r _ 3 are disjoint subsets ofV(G) such 
that (Y, Zj) is (e, d) -regular for each j G [r-3]. Let Z :— Z\ U . . . U Z r _ 3 , and define 

for each S C y . Then we have 

S:={Se(J)j : \B(S)\ ^y\Z\^) < y/i\Yf. 

Proof. Choose t* sufficiently large such that 

(7) P(Bin(t*, {d/2f-^) <t\^n, 

where Bin(n,p) denotes a random variable with binomial distribution, and set 

(8) £ :=min{(^(r-2<>-3))- 2 }. 

In the first part of the proof we shall construct a family S' of at least O^J) — -^/e] V|** 
sets S G In the second part we will then show that S' C (J^) \ S, which proves 

the lemma. For constructing the sets S G S' we proceed inductively and shall choose the 
vertices V\, . . . , v t * of S one by one, in each step k G [t*] avoiding a set Yk C Y of size 
at most e2 k (r — 3)|F|. Clearly, by (8), this gives at least OYJ) — v^l^l** choices for S as 
desired. 

Indeed, suppose we have already chosen the vertices v±, . . . , v^-i- In addition we have 
chosen for each j G [r — 3] a partition P. 1 of with the following property (we shall 
make use of these partitions in part two of the proof): for each / C {m, . . . , Vk-i} we have 
chosen a part P/ _1 (/) of size (d - e) |7| (l - d + e) k - l ~^\Zj\ such that P/ _1 (/) C N(I). 
Now we will explain how Vk can be chosen together with partitions P- satisfying the above 
conditions. For this purpose consider the set C 7 of vertices y such that for some 
j G [r — 3] and some J C {v\, . . . , f we have 

\N{y)nPf- 1 (r)\<(d-e) k \Z j \, 

where P? := {Zj} is the trivial partition of Zj. The possible choices for Vj. now are the 
vertices in Y \ Yj.. The partitions P^ with j G [r — 3] are defined as follows. For each /' C 
{vi, . . . , Vk-i} we choose an arbitrary subset P of N(vk) nPj c_1 (J) with |P| = (d — e) k \Zj\, 
which is possible by the choice of Vk, and set 

P*(I'):=P*-\I')\P and Pf(P U {z k }) := P . 

Clearly, the partitions defined in this way satisfy that each part PHl) is of size (d— e)' 7 '(l — 
d + £)*H 7 l|Zj| and that P*(I) C N{I) as desired. 

It remains to show that \Yk\ ^ e2 k (r — 3)|Y| as claimed above. If this is not true, then 
for some j G [r — 3] and J C [A; — 1], there exist £|Y| vertices in Y which have at most 
(d-e)\P k (I)\ neighbours in P/ _1 (I). Since |P/ _1 (/)| > {d-ef- l \Z j \ ^ e\Z-\ by (8), this 
contradicts (e, d)-regularity of (Y, Zj). 
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We now turn to the second part of the proof: We claim that for every S G S' we have 
^(S 1 )! < yu|Z|( r_3 )*. To see this, simply choose a random multiset T C Z of size (r — 3)t, 
and observe that N(T) D S is given by the intersection of (r — 3)t sets Si, ... , S( r _3) t C S 
chosen (independently) according to the distribution 

By construction we have |P**(/)| = (d — e)' 7 '(l — d + e)'* - ' 7 ' for every j G [r — 3] and 
ICS. Hence 

P(JCg,)> lUj^U/grggff^Ol = ^ (d- £ )l J 'l(l-d + £ f-|/'l = (d _ £) m. 

' ' icrcs 

This implies that for every I C S, we have P(J C Si) ^ P(J C S]) for the random 
variable 5*- with the following distribution: for every u G S we take w G Sj- independently 
with probability d — e. We conclude that 

P(|Si n . . . n S (r ._ 3 ) t | ^t) = P(J C Si n • • • n S (r _ 3)t for some ICS with |/| ^ t) 

^ P(/ C S[ n ■ • • n 5j r _ 3) i for some ICS with |/| ^ t) 

= ¥(\S[ n . . . n S( r „ 3)t | > t) = p(Bin(r, (d - e) (r - 3)t ) ^ t) 

where the last inequality follows from (7). This proves |£(S)| < /i|Z|( r - 3) * and hence 
finishes the proof of the lemma. □ 

We shall now prove Proposition 36. 

Proof of Proposition 36. We start by defining the constants. Given r > 3, £, t G N and 
7, d > 0, we set 

(9) ^:=-77 ?w w-^rl-V and ^*:=— • 

V ; P 8((r-3)t)!(r-3)( r " 3 )* V2/ 2/i 

Let t* and £o be given by Lemma 37 with input r, t, fi' := 2 _r d. Given a > and C, we 
choose 

/a 2 d 7 \ 2 r C 

(10) £l :=min l2^' 4( 7 + l)(r-3)t ' £ °J &nd ° := ^7- 

Now let < e < £i, let G be a graph, and let X, Y and Z x , . . . , Z r _ 3 be disjoint subsets 
of V(Cr) as described in the statement, so in particular, (X, Y) is (C*, a)-dense in copies of 
. The goal is to show that there exists S G F/iYUZ) such that 5 is (r, £, t, C, a)-good 
for (X,YUZ). 

Our first step is to show that there is a 'well-behaved' function S* G T t * (Y). 
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Claim 38. There is a function S* G J>'** (Y) which is (3,£*,t*, C*, a)-good for (X,Y) and 
has the property that for every / C [£*], the set 

B(S*j) = {t g ( f ) : |iV(T) n S 7 *| < t] 

in (/ r _ 3 u) has size at most 2 _r yu|Z|( r-3 ) t . 

Proof of Claim 38. By Lemma 37 (with input r, i, // = 2 _r /i, d), the total number of 'bad' 
t*-subsets S" of F, i.e., those for which B(S') ^ 2~ p /2\Z\^\ is at most y/e\Yf . Let 
S be the set of functions S* in J 7 ^** (V) which do not have the property that for every 
I C [£*] we have B(5'|) < 2^*/i|Z|( r ~ 3 )'. We can obtain any function S* in 5 by taking a 
set J C [£*] and one of the at most -y/e|^|** 'bad' t*-sets to be S}, and choosing the 2 r — 1 
remaining sets of S* in any way from ( „) . It follows that 



\S\ ^ 2 e *V^\Yf\Y\^- 1 ^ = 2 r ^\Y 



2 l t* 



Since (X, Y) is (C*, a)-dense in copies of Z\i , there are at least 2 e *a\Y \ 2<! ** functions in 
Fff{Y) which are (3,f, t*, C*, a)-good for (X,F). Since by (10) we have 2" r a > 2^*^, 
at least one of these functions is not in S, as required. n 

For the remainder of the proof, S* will be a fixed function satisfying the conclusion of 
Claim 38. Since S* is (3, £*, t*, C*, a)-good for (X,Y), there exist sets 

£*, ...,E* e ,C E(X), with d(E*) ^ 2- r 'aC* for each 1 ^ j < t , 

such that for every ei G . . . , e^* G we have e r — >■ 5*. 

Our next claim comprises two applications of the pigeonhole principle to find a copy of 
K r -z(t) in Z. 

Claim 39. There exists a copy T of i^ r _3(t) with t vertices in Zj for each j G [r — 3], and 
a set L C [£*] of size |L| = £ such that: 

(i) |JV(T) n 5 7 * I ^ t for every / C [F], 

(m) iV(T) contains at least fi\E*\ edges of E*, for each j G E. 
Proof of Claim 39. By assumption, for every x G X and j G [r — 3] we have 

\N{x)nz j \ >(\ + J\ \Zj\, 



and so each edge e G E* U . . . U E|* has at least 7] Zj | common neighbours in Zj . By 
Fact 3, the common neighbours of e in Z, and Zj form an (e/7, d — e)-regular pair for each 
1 ^ « < j ' ^ r — 3. By (10) we have d— e — (r — 3)te/7 > d/2. Hence, applying the Counting 
Lemma with d replaced by c? — 5 and 5 replaced by 5/7 to the graph Ef = E/ r _3(t), it follows 
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that there are at least 



Ant(H)\ 7 1 7 Vr- 



((r-3)«)!V2. 

copies of K r _ 3 (t) in iV(e) D Z, each with i vertices in each Zj. 

There are therefore, for each j G [£*], at least &ii\Z\( r ~^ t \E*\ pairs (e,T), where e G E* 
and T is a copy of K r -z(t) as described, such that T C iV(e), or equivalently e C N(T). 
Since we have 

8/x|^] (T — 3) *[£7J| =4/i|Z| (r - 3)t |E;| +4 y u|E;||Z| (r - 3)i , 

by the pigeonhole principle, it follows that there are at least 4/x|Z| (r ~ 3 ) < copies of K r - 3 (t) 
in Z each of which has at least 4fi\E*\ edges of E* in its common neighbourhood. Let 
us denote by Tj the collection of such copies of K r - 3 {t). For a copy T of K r - 3 (t), let 
L(T) = {j : T G Tj}. 

We claim that there is a set T containing at least 2/z|Z| (r ~ 3 ) i copies T of K r ^(t) in Z, 
each with |L(T)| ^ £. Indeed, this follows once again by the pigeonhole principle, since 
there are at least 

t ■ 4/i|Z| (r - 3) * = £\Z\ (r - 3)t + t ■ 2fi\Z\ {r ~ 3)t 

pairs (T, j) with T eTj. 

Now, recall that S* satisfies the conclusion of Claim 38, i.e., for each I C [£*], there are 
at most 2- r /i|Z|( r - 3 )* sets T G ( (r f 3)t ) such that |iV(T)n£?| ^ f. Since |T| ^ 2^\Z\^\ 
there is a copy T of K r ^ 3 (t) G T such that for each 7 C [£*), we have |JV(T) n Sf\ ^ t. If 
we let L be any subset of L(T) of size £, then T and L satisfy the conclusions of the claim. 

□ 

Let T and L be as given by Claim 39 and for each j 6 I let Ej C X be a set of | 
edges of £^ contained in N(T) as promised by Claim 39(m). We construct a function 
S G by choosing, for each I C L, a subset S7 C of size i in JV(T) D F (which is 

possible by Claim 39(i)), and letting the sets Si, i G [r — 3], be the parts of T. 

Claim 40. 5 is (r, £, t, C, a)-good for (X, YUZ). 

Proof of Claim 40. Recall that \L\ = £, and assume without loss of generality that L = 
{1, . . . ,£}. By the choice of T and the definition of the sets Sj with JCL and the sets Si 
with i G [r — 3], we have that is completely adjacent to each SV with % ^ i', to each Si, 
and to each edge e G Ujez Since e r — > £* for each e e * G -B* x . . . x it follows that 
— > S for each e G E% x . . . x Eg. Finally, for each j G L, since ^ A*|£j|> we have 



as required. 



THE CHROMATIC THRESHOLDS OF GRAPHS 



31 



Thus there exists a function S G which is (r, £, t, C, a)-good for (X, Y U Z), as 

required. □ 

Remark 41. It is possible to strengthen the conclusion of Proposition 36: under the 
same conditions, (X, Y U Z\ U • • • U Z r _ 3 ) is (C, a')-dense in copies of for some a' = 
a'(r, £, t, d, 7, a) > 0. To see this, observe that the proofs of Claims 38 and 39 both in fact 
yield a positive density of functions S* in F^f (Y) and of copies T of K r s(t), respectively. 
From any such S* and T can be obtained a function S which is (r, £, t, C, a)-good for 
(X,YUZ). 

We can now deduce Theorem 34. 

Proof of Theorem 34- The lower bound is given by Proposition 35, so we are only required 
to prove the upper bound. Let H be an r-near-acyclic graph, with r ^ 4, and let 7 > 0. 
Because H is r-near-acyclic, by Observation 20 there exist trees Ti, . . . ,7} and a number 
t G N such that H C ^''(Ti, . . . , T<). We now set constants as follows. First, we choose 
d = 7. Given r, i, d and 7, Proposition 36 returns integers and i*. Now Proposition 26, 
with input £*,t* and d, returns a > 0. Next, consistent with Lemma 25 we set C := 
2^ +3 a _1 Yli=i Feeding a and C into Proposition 36 yields E\ > and C*. Putting C* 
into Proposition 26 yields a constant C". We choose 

(11) ko := 2r/7 and £ := min(e 1 , 7) . 

Finally, from the minimum degree form of the Szemeredi Regularity Lemma, with input e, 
d, 5 = (^zf + 37) and ko, we obtain a constant k\. 

Let G be an if- free graph on n > k\ vertices, with 5(G) ^ (^5§ + 37) n. We shall 
prove that x(G) ^ 2 ■ 2 2kl C. First, applying the minimum degree form of the Szemeredi 
Regularity Lemma, we obtain a partition Vo U . . . U Vk of V(G), with reduced graph R, 
where 5(R) ^ (^5§ + 7) k. We form a second partition by setting 

X(h,h) := G : ieh & \N(v) n K| ^ 71^1 

and z'Gl 2 ^ |7V(u)nVi| > Q + 7jl^l 

for each pair of sets I 2 C ii C [fc] . It obviously suffices to establish that for each ix and 7 2 
we have x(G[X(h, I 2 )}) < 2C 

Hence let I 2 Q h Q [k] be fixed. Since x{G[X(h, h)}) < 2C" is obvious when X(I U I 2 ) 
is empty, assume it is non-empty. Then the minimum degree condition on G allows us to 
establish the following claim. 

Claim 42. There exist distinct clusters Y, Y' G l\ and Z\, Z[, . . . , Z r ^, Z' r _ 3 G I 2 such that 
(Y, Zi), (Y f , Z'i), (Zi, Zj) and (Z' {) Zj) are (e, <i)-regular for every pair {i, j} C [r — 3]. 

Proof of Claim 42. Let x be any vertex in X(7i, 12), and let m — \V\\ — ■ ■ ■ — \Vk\- By the 
definition of X(I\, I 2 ), we have \N(x) D Vi\ ^ 7m iff z G Ji, and thus 

(;=§ + 3 7)^ < < ^ eri + (A; - |ii|)7m + (i^m < (e + 7)71 + |/i|f . 
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Since by (11) we have e < 7, we deduce ^ | + j)k. Similarly, we have \N{x) C\Vi\ ^ 
{\ + 7)m iff i 6 J2 and therefore 

(^zf + 37)^ < d(x) ^en+ (k - \I 2 \) (\ + 7 )m + \I 2 \m < (e + \ + j)n + | J 2 |§ , 

from which we obtain |J 2 | ^ + 7)^. 

Since 5(R) ^ (jj^f + eacn cluster in R has at most ~ l)k non-neighbours. It 
follows that 

8{R[I 2 ]) > |/ 2 |--A_ + 7 fc^ (r=| + 7 ) |/ 2 |, 

so by Turan's theorem, i?[/ 2 ] contains a copy of K r _ 3 . We let its clusters be Z 1; . . . , Z r _ 3 . 
Since each Zj is non-adjacent to at most (-3= cluster in J 1; there is a cluster Y in Jx 

adjacent in R to each Zj with i G [r — 3]. Since k ^ /c 0; by (11) we have 7/c — (r — 2) ^ 7^/2 
and therefore 

8{R[I 2 \ {Y, Z u ..., Z r ._ 3 }]) Z \h\ - ^ + h k > + 57) |^2 \ {y, Z 1 ,..., Z r „ 3 }| • 

Thus we can again apply Turan's theorem to R[I 2 \ {Y, Z x , . . . , Z r __ 3 }] to obtain a clique 
Z[, . . . , Z',_ 3 in J 2 , which has a common neighbour Y' G i~i \ {Y, Zi, . . . , Z r _ 3 }, as required. 

u 

Let y, y' G /1 and Zi, Z{, . . . , Z r _ 3 , Z^._ 3 C I 2 be the clusters given by Claim 42. Let X = 
X(I 1} I 2 )n(Y'UZ[U- ■ -UZ' r _ 3 ), and X' = X(I 1: h)\X. Observe that X, Y, Z u . . . , Z r _ 3 are 
pairwise disjoint (as are X', Y', Z[, . . . , Z^,_ 3 ). Our goal now is to show that x{G[X]) ^ C. 
Since an analogous argument gives x((jr[X']) ^ C and we have X(Ii, I 2 ) = X U X', this 
will imply x(G[X(ii, J 2 )]) ^ 2C', and thus complete the proof. 

We apply Proposition 26, with input £*, t*, d and C*, to (X, Y). Observe that, since 
Y G h and X C X(Ii,I 2 ), we have |X(x) D Y\ ^ for each x e X. Recall that a 
and C were defined such that the conclusion of Proposition 26 is the following. Either 
x(G[X]) ^ C, or (X, Y) is (C*, a)-rich in copies of Zj»* . In the first case we are done, so 
we assume the latter. We will show that this contradicts our assumption that G is H-free. 

By Lemma 24 the pair (X, Y) is (C*, a)-dense in copies of ZpT . We now apply Propo- 
sition 36, with input r, £, t, d, 7, a, C, and e to X, Y, Zi, . . . , Z r _ 3 . Observe that since 
Z\i • • • , Z r _ 3 G I 2 , we have |X(x) fl Zj| ^ (| + 7)|Z i | for each a; G X and z G [r — 3]. 
Moreover, by Claim 42, any pair of Y, Zi, . . . , Z r _ 3 is (e, d)-regular. Recall that £*, t*, E\ 
and C* were defined such that the conclusion of Proposition 36 is that there exists a func- 
tion S G J^'*(Y U Z l U • ■ • U Z r _ 3 ) which is (r, £, t, C, a)-good for (X, Y U Z x U • • ■ U Z r _ 3 ). 
Finally, we apply Lemma 25, with input r, £, t, a and Ti, . . . , T#, to X and YUZiU- ■ -UZ r _ 3 . 
By the definition of C, this lemma gives that H C Zp*(Ti, . . . ,T^) is contained in G, a 
contradiction. □ 

Finally, we put the pieces together and complete the proof of Theorem 2. 

Proof of Theorem 2. Let H be a graph with x{H) = r ^ 3, and recall that M.(H) denotes 
the decomposition family of H. By Proposition 5, if M.(H) does not contain a forest then 
S X (H) = ^5? , an d by Theorem 7, if A4(H) does contain a forest then 8 X (H) ^ |^5§ • 
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Now, by Theorem 16, if H is not r-near-acyclic then 5 X (H) ^ , and by Theorem 34, 
if H is r-near-acyclic then 5 X (H) = Thus 

* ,tt\ fr-3 2r-5 r-2] 

where 5 X (H) ^ ^5? if and only if if has a forest in its decomposition family, and 5 X (H) = 
if and only if H is r-near-acyclic, as required. □ 

8. Open questions 

Although we have determined S X (H) for every graph H, there are still many important 
questions left unresolved. In this section we shall discuss some of these. We begin by 
conjecturing that the assumption on the minimum degree can be weakened to force the 
boundedness of the chromatic number, as Brandt and Thomasse [8] proved in the case of 
the triangle. 

Conjecture 43. For every graph H with 5 X (H) = X(H), there exists a constant C(H) 
such that the following holds. If G is an H-free graph on n vertices and 6(G) > X(H)n, 
then x{G) < C(H). 

We mention that an analogous statement is not true for H with S X (H) e {9(H), it (H)} 
as a simple modification of our constructions for Propositions 5 and 35 shows: we merely 
need to make the partite graphs used in these constructions slightly unbalanced and to 
guarantee that the Erdos graphs cover the whole partition class they are pasted into and 
have a sufficient minimum degree. 

For graphs H with S X (H) = one could still ask whether the minimum degree condition 
can be weakened to some function f(n) = o(n). The following well-known fact shows that 
this is not the case. 

Proposition 44. Let H be a graph with x(H) ^ 3, and let f(n) = o(n). For every 
C and n±, there exist H-free graphs G on at least n\ vertices with 5(G) ^ f(v(G)) and 
X(G) > C. 

Proof. Given H, f, C and n\, let Go be a (C,v(H) + 1)-Erd6s graph. Without loss of 
generality, we may assume 5 (Go) ^ 1. Let n be such that f(n) ^ u/v(Gq) for each 
n ^ no- Let G be obtained from Go by blowing up each vertex to a set of size max(no, n\). 
Then G has at least n\ vertices, and we have 5(G) ^ v(G)/v(G ) ^ f(v(G)). Since 
Go contains no cycle on v(H) or fewer vertices, G contains no odd cycle with v(H) or 
fewer vertices. In particular, every v(H)-vertex subgraph of G is bipartite, and hence G is 
H-free. □ 

Proposition 44 also implies that for graphs H with 5 X (H) = the upper bound on x(G) 
for H-free graphs G with 5(G) ^ en increases as e goes to zero. This suggests the following 
problem. Set 

5 x (H,k) := inf {d : 5(G) ^ d\G\ and H£G X (G) ^ fcj , 
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or, equivalently, 

Xs{H,d) := max|x(G) : 5(G) ^ d\G\ and H % g\ , 
and call this the chromatic profile of H . 

Problem 45. Determine the chromatic profile for every graph H . 

As noted in the Introduction, we have, by the results of Andrasfai, Erdos and Sos [3], 
Brandt and Thomasse [8], Haggkvist [18] and Jin [20], that 

5 X (K 3 ,2) = | 5 X (K 3 ,3) = ^ and 5 x (K 3 ,k) = 1 for every k ^ 4. 

We remark that this problem was also asked by Erdos and Simonovits [14], who remarked 
that it seemed (in full generality) 'too complicated' to study; despite the progress made 
in recent years, we still expect it to be extremely difficult. Note that although our results 
give explicit upper bounds on xs(H,d) for every graph H, even in the case 5 X (H) = 0, 
where we do not use the Szemeredi Regularity Lemma, these bounds are very weak. 

Luczak and Thomasse [30] suggested the following more general problem. Given a (with- 
out loss of generality monotone) family J 7 of graphs, we define 

5 X (J-") := inf 1 5 : 3 C = C^J 7 , 5) such that if G G T is a graph on n vertices 

with 5(G) ^ 5n, then X (G) ^ c\ . 

Problem 46. What values can 5 X (F) take? 

Our results settle this question completely when J 7 is defined by finitely many minimal 
forbidden subgraphs (in which case 5 X (F) is precisely the minimum of 5 X (H) over all 
minimal forbidden subgraphs H). For families T defined by infinitely many forbidden 
subgraphs, however, this minimum provides only an upper bound on 5 X (J : ). 

Luczak and Thomasse [30] suggested in particular to determine 5 X (B), where B is the 
family of graphs G such that for every vertex v 6 G, the graph G[iV(w)] is bipartite (as a 
natural generalisation of the family of triangle- free graphs, in which every neighbourhood is 
an independent set). This family is indeed defined by infinitely many forbidden subgraphs: 
to be precise, by the odd wheels. Luczak and Thomasse gave a construction showing that 
5 X (B) ^ |, and conjectured that 5 X (B) = |. Since the wheel W 5 (i.e., the graph obtained 
from C5 by adding a vertex adjacent to all its vertices) is a forbidden graph for B, and 
5 X (W§) = 5 by Theorem 2, our results confirm that their conjecture is true. 

One can generalise the concept of chromatic threshold to uniform hypergraphs. Recently, 
Balogh, Butterfield, Hu, Lenz and Mubayi [4] extended the Luczak-Thomasse method to 
uniform hypergraphs "H, and thereby proved that 5 X (T-L) = for a large family of such %. 
To quote from their paper, 'Many open problems remain'. 

Finally, we would like to introduce a new class of problems relating to the chromatic 
threshold. There has been a recent trend in Combinatorics towards proving 'random 
analogues' of extremal results in Graph Theory and Additive Number Theory (see, for 
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example, the recent breakthroughs of Conlon and Gowers [10] and Schacht [36]). We 
propose the following variation on this theme: for each graph H and every function p = 
p(n) G [0, 1], define 

5 x (H,p) := inf |d : 3C(H,d) such that for G = G UtP , asymptotically almost surely, 

if G' C G, 6{G') ^ dpn and H £ G', then X (G') < C(#,d)}, 

where G n ^ p is the Erdos-Renyi random graph. Note that when p{n) = 1, we recover the 
definition of 6 X (H). 

Problem 47. Determine S x (H,p) for every graph H , and every p = p(n). 

In a forthcoming paper [2] we intend to show that for every constant p > and every 
graph H, we have 5 X (H) = S x (H,p). This is of course trivial in the case S X (H) = 0, when 
it follows from the results of this paper together with the well-known fact that for constant 
p, the minimum degree of G n>p is asymptotically almost surely at least pn/2. In the case 
5 X (H) > 0, the result is not trivial: but much of the machinery developed in this paper 
can be used unchanged. The following construction shows that the result is best possible, 
in the sense that it fails to hold for p = o(l). 

Theorem 48. Let r ^ 3 and C G N, and let H be a graph with x{H) = r and S X (H) ^ 
X(H) = 2^E§ ■ If £ > is sufficiently small, the following holds. If n~ e < p < e 2 , then 
asymptotically almost surely the graph G = G n ^ p contains an H-free subgraph G' with 
X(G') ^ C and S(G') ^ (1 - e)^pn. 

Proof (sketch). Given r, H and C, we let F be a fixed (C,v(H) + 1)-Erd6s graph. We 
choose a sufficiently small e > 0. 

We now construct an H-free subgraph G' of G — G n , v as follows. Let Vi, . . . ,V r -i 
be an arbitrary balanced partition of [n\. We fix a copy of F within G[V\] (which exists 
asymptotically almost surely). Then we delete all edges within each part Vi with i G [r — 1], 
except those in the copy of F . Moreover, for each pair of vertices u, v G V(F), we delete 
the edges from u and v to the common neighbours of u and v in each of V2, ■ ■ ■ , V r -\- 

It follows that x{G') ^ C and that G' asymptotically almost surely has minimum degree 
(1 — s)jz^pn. In addition it can easily be checked from our characterisation of graphs H 
with 8 X {H) > X(H) that G' is also H-free. □ 

Theorem 48 can be significantly strengthened, and we intend to do so in [2]. However, 
results of Kohayakawa, Rodl and Schacht [22] show that we cannot increase the value 
in the minimum degree, i.e, S x (H,p) ^ Thus, by Theorem 48, and in contrast to 

the p = 6(1) case, if 5 X (H) ^ A (if) and n~°^ <p = o(l), then 5 x (H,p) = g = A H )- 
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